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Abstract 



In this paper we show that the closure of a random sample for a /c-dimensional random 
vector is almost surely a deterministic set of all heavy points of the distribution. A heavy 
point is defined to be a point for which all its neighborhoods have positive probability. 
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1 Introduction 

Although a random variable is deterministic and random by definition, one can 
find deterministic features specially when an infinite random sample is available. A 
deterministic feature can be an event with probability 1. Kolmogorov 0-1 Lemma 
(See pQ for example) provides a general result for finding events with probability 1. 
One can also look for random sets that are almost surely deterministic. This paper 
finds one of the simplest of such almost sure deterministic sets. We show that the 
closure of a random sample for a fc-dimensional random vector is almost surely the 
deterministic set of all heavy points of the distribution. A heavy point is defined to 
be a point for which all its neighborhoods have positive probability. 

2 Closure of random samples 

Suppose (Q, S, P) a probability space and X : Q random vector with 

respect to the Borel sigma-algebra on M fc with distribution function Fx- We begin 
by a definition. 

Definition 2.1: x G M. k is called a heavy point of a distribution function Fx if 



P(X eU)>0, 
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2 Closure of random samples 



if U is an open set and includes x. We call the set of all such points the heavy set of 
X and denote it by H(X). Also we call the N(X) = R k - H(X) the weightless set 
or null set of X. It is clear that these sets only depend on the distribution function. 

In the following, we prove a lemma about the properties of such sets. 

Lemma 2.1: Suppose X and Fx are defined as above. Then 

a) N(X) is open. 

b) H(X) is closed. 

c) P(X G N(X)) = 0. 

Proof We will denote an open ball of radius e > around x by Ba(x, e). 

a) For every x G N(X) by definition of H(X) there exist and e > such that 

P(X G Ba(x,e)) = 0. 

But then it is obvious that any y G Ba(x, e) also belongs to N(X) (simply find e' 
such that Ba(y, e') C Ba(x, e)) and the proof is complete. 

b) Straight forward corollary of a) 

c) Let Q' = Q n N(X) = {q u q 2 , ■ ■ ■ } and 

r[ = supP(X G Bafar)) = 0. 

r>0 

r' 

Such r\ is positive since G N[X). Then let r\ = -j- < r\. We have P(X G 
Ba(qi, Tj)) = 0. Let 

iV = U^ 1J Ba(g,,r l )ciV(X). 

Since N is a countable union of sets with zero probability, it's probability is also 
zero. It only remains to show that N(X) C N. Take x G N(X) then there exists 
r > such that 

P(X G Ba(x,r)) = 0. 

Take a rational number in Ba(x,r) such that |rr — q\ < ^ then q = qi E Q' and 
Ba(qi, C Ba(x,r) and hence 

, 3r 3r r 

Hence x G Ba(q i: ri) C N. ■ 
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Before to prove the main theorem we point out an important property of 
closed subsets of M. k . A topological space is called separable if it posses a countable 
dense subset. It is well-known that any topological space with a countable basis is 
separable. See [2] page 49 for example. 

Lemma 2.2: Every closed subset of M fc is separable. 

Proof We offer two proofs one based on the properties of separable spaces men- 
tioned above and an elementary proof. Suppose E is closed in M fc . 
Proof 1. E admits a countable basis consisting of all intervals with rational endpoints 
in E. Then the proof is complete since every topological space with a countable ba- 
sis is separable. 

Proof 2. Consider the /c-dimensional rational space Q k and let 

Q n = {Ba(q,l/n), q G Q k }, n = 1, 2, - - - 

and for every n we define D n as follows. If E D Ba(q, 1/n) is non-empty choose 
one element of this set x q ,\/ n and put in D n . Hence D n is countable and so is 
D = \J™ =1 D n . We also claim D is dense in E. Suppose e£E then it is obvious that 
for every e > there exist q, n such that 

Ba(q,l/n) C Ba(x,e) 

but by construction we picked an element of E from Ba(q, 1/n). ■ 

Lemma 2.3: Suppose X a /c-dim random vector, x G H(X) and X\, X 2 , • • • a random 
sample from X. Let E = {Xi, X 2 , • • • } then x 6 E, a.s.. 

Proof Let P(X G Ba(x, 1/n)) = p n > and 

A n = {oo G fi|3i, Xi(u) e Ba(x,l/n)}. 

Then 

A = n°l x Ai = {ou G Q\3i, Xi(u) G Ba(x, 1/n), n = l,2,---}. 
We need to show P(A) = 1. 



P(A n ) = P(3i, ^ G So(ar, 1/n)) = 1 - P(Vz, X ^ Bo(z, 1/n)) = 
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1 - n p ( x * ^ b ( x > i / n )) = i - (i - Pn)°° =1-0. 



i=l 



Now P(A) = lim n _ >00 P(A n ) = 1. 



Theorem 2.1: Suppose X 1 , X 2 , ■ ■ ■ be a random sample from X a k- dimensional ran- 
dom vector. Let E = {Xi, X 2 , ■ ■ ■} and suppose E is the closure of E with respect 
to the Euclidean distance topology of R h . Then E = ff (X), almost surely. 

Proof (i) ff(X) C E, a.s.: Since ff (X) is closed, it is separable and take a dense 
countable subset of ff (X), 

if' = {/*!, /l 2 ,...}. 

Let Q = {uj G G P} and C = D^Cj. Then P(Cj) = 1 by the previous lemma 
for each i = 1,2, • • • . But then P(C) = 1 which means ff C P, a.s. and hence 
H = WcE. 

(ii) P C H(X), almost surely. We only need to show E C ff (X), almost surely. 
But we showed P(X* G ff (X)) = 1 - P(X G X(X)) = 1-0, i = 1, 2, • • • . Let 
Ai = {uj G fi|X G ff (X)} and A = n^iA- Then A = P C ff (X) and 

P(A) = lim P(A n ) = 1. 
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